Inverse problems are paramount in Science and Engineering. In this paper, we consider the setup of Statistical Inverse Problem (SIP) and demonstrate how Stochastic Gradient Descent (SGD) algorithms can be used in the linear SIP setting. We provide consistency and finite sample bounds for the excess risk. We also propose a modification for the SGD algorithm where we leverage machine learning methods to smooth the stochastic gradients and improve empirical performance. We exemplify the algorithm in a setting of great interest nowadays: the Functional Linear Regression model. In this case we consider a synthetic data example and examples with a real data classification problem.
translated by 谷歌翻译
我们通过反馈信息研究了离线和在线上下文优化的问题,而不是观察损失,我们会在事后观察到最佳的动作,而是对目标功能充分了解的甲骨文。我们的目标是最大程度地减少遗憾,这被定义为我们的损失与全知的甲骨所产生的损失之间的区别。在离线设置中,决策者可以从过去段中获得信息,并且需要做出一个决策,而在在线环境中,决策者在每个时期内都会动态地基于一组新的可行动作和上下文功能,以动态进行决策。 。对于离线设置,我们表征了最佳的最小策略,确定可以实现的性能,这是数据引起的信息的基础几何形状的函数。在在线环境中,我们利用这种几何表征来优化累积遗憾。我们开发了一种算法,该算法在时间范围内产生了对数的第一个遗憾。
translated by 谷歌翻译
The monograph summarizes and analyzes the current state of development of computer and mathematical simulation and modeling, the automation of management processes, the use of information technologies in education, the design of information systems and software complexes, the development of computer telecommunication networks and technologies most areas that are united by the term Industry 4.0
translated by 谷歌翻译
Can we leverage the audiovisual information already present in video to improve self-supervised representation learning? To answer this question, we study various pretraining architectures and objectives within the masked autoencoding framework, motivated by the success of similar methods in natural language and image understanding. We show that we can achieve significant improvements on audiovisual downstream classification tasks, surpassing the state-of-the-art on VGGSound and AudioSet. Furthermore, we can leverage our audiovisual pretraining scheme for multiple unimodal downstream tasks using a single audiovisual pretrained model. We additionally demonstrate the transferability of our representations, achieving state-of-the-art audiovisual results on Epic Kitchens without pretraining specifically for this dataset.
translated by 谷歌翻译
This paper is a technical overview of DeepMind and Google's recent work on reinforcement learning for controlling commercial cooling systems. Building on expertise that began with cooling Google's data centers more efficiently, we recently conducted live experiments on two real-world facilities in partnership with Trane Technologies, a building management system provider. These live experiments had a variety of challenges in areas such as evaluation, learning from offline data, and constraint satisfaction. Our paper describes these challenges in the hope that awareness of them will benefit future applied RL work. We also describe the way we adapted our RL system to deal with these challenges, resulting in energy savings of approximately 9% and 13% respectively at the two live experiment sites.
translated by 谷歌翻译
Obstacles on the sidewalk often block the path, limiting passage and resulting in frustration and wasted time, especially for citizens and visitors who use assistive devices (wheelchairs, walkers, strollers, canes, etc). To enable equal participation and use of the city, all citizens should be able to perform and complete their daily activities in a similar amount of time and effort. Therefore, we aim to offer accessibility information regarding sidewalks, so that citizens can better plan their routes, and to help city officials identify the location of bottlenecks and act on them. In this paper we propose a novel pipeline to estimate obstacle-free sidewalk widths based on 3D point cloud data of the city of Amsterdam, as the first step to offer a more complete set of information regarding sidewalk accessibility.
translated by 谷歌翻译
已经开发了增强学习(RL)技术来优化工业冷却系统,与传统的启发式政策相比,提供了可观的节能。工业控制中的一个主要挑战涉及由于机械限制而在现实世界中可行的学习行为。例如,某些操作只能每隔几个小时执行一次,而其他动作可以更频繁地采取。如果没有广泛的奖励工程和实验,RL代理可能无法学习机械的现实操作。为了解决这个问题,我们使用层次结构的增强学习与多种根据操作时间尺度控制动作子集的代理。我们的分层方法可以在现有基线上节省能源,同时在模拟的HVAC控制环境中保持在安全范围内的限制(例如操作冷却器)。
translated by 谷歌翻译
我们提出了一个混合工业冷却系统模型,该模型将分析解决方案嵌入多物理模拟中。该模型设计用于增强学习(RL)应用程序,并平衡简单性与模拟保真度和解释性。该模型的忠诚度根据大规模冷却系统的现实世界数据进行了评估。接下来是一个案例研究,说明如何将模型用于RL研究。为此,我们开发了一个工业任务套件,该套件允许指定不同的问题设置和复杂性水平,并使用它来评估不同RL算法的性能。
translated by 谷歌翻译
在机器学习研究社区中,关于模型复杂性与所需数据和计算能力的关系之间的关系存在共识。在现实世界应用中,这些计算要求并非总是可用的,激发了对正则化方法的研究。此外,当前和过去的研究表明,更简单的分类算法可以在计算机视觉任务上达到最先进的性能,并给定一种强大的方法来人为地增强培训数据集。因此,近年来,数据增强技术成为流行的研究主题。但是,现有的数据增强方法通常不如其他正则化方法传递。在本文中,我们确定了数据增强算法应用的主要领域,所使用的算法,重要的研究趋势,随着时间的推移的发展以及数据增强文献中的研究差距。为此,相关文献是通过Scopus数据库收集的。它的分析是在网络科学,文本挖掘和探索性分析方法之后进行的。我们希望读者能够了解数据扩展的潜力,并在数据增强研究中确定未来的研究方向和开放问题。
translated by 谷歌翻译
这项工作介绍了一个新颖的知识蒸馏框架,用于分类任务,其中可用并考虑到现有子类信息。在具有少量类或二进制检测的分类任务中,从教师到学生的信息量受到限制,从而限制了知识蒸馏的效用。通过利用类中可能的子类信息可以提高性能。为此,我们提出了所谓的子类知识蒸馏(SKD),这是将预测子类知识从老师转移到较小学生的过程。在老师的课堂逻辑中不存在的有意义的信息,而是在子类徽标中存在(例如,课堂内的相似之处)将通过SKD传达给学生,然后将提高学生的表现。从分析上,我们衡量教师可以通过SKD向学生提供多少额外信息,以证明我们工作的功效。开发的框架是在临床应用中评估的,即结直肠息肉分类。这是两个类别和每个类的许多子类的实际问题。在此应用程序中,使用临床医生提供的注释来根据注释标签的学习方式来定义子类。接受SKD框架训练的轻巧,低复杂的学生的F1得分为85.05%,提高了1.47%,比学生分别接受和没有常规知识蒸馏的学生获得了2.10%的收益。接受和没有SKD的学生之间的2.10%的F1得分差距可以通过额外的子类知识来解释,即,每个样本的额外的0.4656标签位可以在我们的实验中转移。
translated by 谷歌翻译